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(57) Abstract 

A continuous stream of formatted digital data, such 
as a video segment, audio segment, or information stream, 
appears to be a fixed length segment under certain circum- 
stances, defining a virtual segment within the continuous 
stream which moves forward in time in synchrony with 
the continuous stream. The virtual segment thus defined 
can be explored in a non-linear fashion at arbitrary play- 
back rates. For instance, concepts such as rewind, pause, 
frame advance, and fast forward become meaningful even 
though the continuous stream never ceases. 
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Method And Apparatus Implementing Random Access 
And Time-Based Functions On A Continuous Stream Of 

Formatted Digital Data 



BACKGROUND OF THE INVENTION 

TECHNICAL FIELD 

The invention relates to digital signal processing. More particularly, the 
invention relates to a family of methods which provide for the pass through or 
capture of linear streams of digital infomnation represented in various fomiats. 

DESCRIPTION OF THE PRIOR ART 

With the advent of consumer audio and video products that employ 
sophisticated digital signal processing techniques it is becoming necessary to 
find ways to exploit the M\ potential of digital technology. For example, il 
would be desirable to provide methods for the pass through or capture of 
linear streams of digital infomiation represented in various fomiats, while at the 
same time providing the appearance to the consumer of a locally stored 
digital stream that allows for the repositioning and playback of virtual 
segments of the apparently local digital stream. 

A mechanical device which performs some of these functions is the Video 
Cassette Recorder (VCR), which uses a magnetic tape to store the 
information. The inherently linear nature of tape leads to functions such as 
rewind, fast forward, and pause. However, a VCR cannot both capture and 
play back infomnation at the same time, so it cannot be used to implement this 
capability. 

Linear streams of information are a fixture of modem life. Consider broadcast 
radio stations, broadcast television stations, satellite broadcasts, cable 
television, video tapes, and compact disks. Increasingly, such infomnation is 
represented in a fashion suitable for manipulation by automated electronic 
hardware, such as computers or media decoders. For example, the Direct 
Video Broadcast (DVB) standards address digital broadcasting from 
satellites, terrestrial stations, and cable television systems. Even analog 
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broadcasts, such as nomial N^SC (National Television Standards 
Committee) broadcasts from familiar local stations, may be captured and 
digitized in real time by modem equipment, making them appear to be linear 
digital streams. 

Though such streams never terminate, and an individual viewer of the stream 
is unable to effect directly how such streams are delivered, it is desirable to 
provide the fliusion for the consumer that recent portions of the stream are 
stored locally in some manner, such that typical VCR-like functions can be 
performed on the stream, e.g. pause, rewind, and fast fonward. The desire for 
this capability arises from the fact that the schedule and timing of the broadcast 
almost never matches the needs of the individual viewer. For instance, the 
viewer may wish to stop the stream for a few moments to discipline an unruly 
child. Or perhaps the viewer's attention was distracted from the stream for a 
few moments, causing him to miss a critical scene, in which case the viewer 
would like to rewind to the point he missed and play it again. 

Ideally, a device local to the viewer should capture the entire stream as it is 
being broadcast and store it in some manner. For example, if two video tape 
recorders are available, it might be possible to Ping-Pong between the two. 
In this case, the first recorder is started at the beginning of the program of 
interest. If the viewer wishes to rewind the broadcast, the second recorder 
begins recording, while the first recorder is halted, rewound to the appropriate 
place, and playback initiated. However, at least a third video tape recorder is 
required if the viewer wishes to fast fonward to some point in time after the 
initial rewind was requested. In this case, the third recorder starts recording the 
broadcast stream while the second is halted and rewound to the appropriate 
position. Continuing this exercise, one can quickly see that the equipment 
becomes unwieldy, unreliable, expensive, and hard to operate, while never 
supporting all desired functions. In addition, tapes are of finite length, and may 
potentially end at inconvenient times, drastically lowering the value of the 
solution. 

It is possible to implement this capability using a digital computer, where 
digital streams are stored in some fashion analogous to video tape and where 
the computer performs the switching between the various virtual tape decks. 
Even using a digital computer, this strategy suffers from the same 
weaknesses as the physical system above. It would be desirable to avoid 
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these issues by providing a technique for storing the streams of infomiation 
on a temporary basis. 



When using a digital computer to perform any technique which achieves this 
functionality, there are a number of issues which must be taken into account for 
proper operation. The first of these is storage of the broadcast stream. Within 
a digital computer, a stream of infomiation is represented as a sequence of 
blocks of digital data. For example, when encoding an NTSC television 
broadcast stream, each field of analog data is converted to a block of 8-bit 
digital samples representing the field. If the analog signal is faithfully 
represented, each digital block contains approximately 0.5 MB of data, one 
second of video requires approximately 30 MB of storage, and 30 seconds 
of video requires approximately 900 MB of storage, greater than the 
capacity of a compact disc. Manipulation of video in this form clearly becomes 
unworkable when any useful length of stored video is contemplated. 

As an example, consider U.S. patent no. 5.625,46, which concerns the use 
of a magneto-optic disk for the storage of broadcast television transmissions. 
The amount of storage available on such media is currently about 5 to 10 
gigabytes, which is sufficient for approximately 5 seconds of video storage - 
clearly insufficient. In addition, the device disclosed does not perniit the 
simultaneous recording and playback of the same program. 

Limited storage capacity is dealt with by compressing the video stream using 
an algorithm, typically one of the MPEG (Moving Pictures Experts Group) 
standard algorithms, which can achieve a useful compression of 100:1 n 
many instances. MPEG video is represented as a sequence of Groups Of 
Pictures (GOPS), in which each GOP begins with an index frame, called the I- 
frame. The l-frame is a block of digital data which is compressed using 
Discrete Cosine Transfomi (DCT) and other techniques, similar to the still- 
picture Joint Photographic Experts Group (JPEG) standard. 

The GOP may represent up to 15 additional frames by providing a much 
smaller block of digital data that indicates how small portions of the l-frame, 
referred to as macroblocks, move over time. Thus, MPEG achieves ifs 
compression by assuming that only small portions of an image change over 
time, making the representation of these additional frames extremely 
compact. 
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Unlike the uncompressed data example above, or examples based on 
video tape recording, each frame is thus represented as a variable length 
block of binary data. Additionally, although GOPs have no relationship 
between themselves, the frames within a GOP have a specific relationship 
which builds off the initial l-frame. Thus, any method which stores a digitized 
stream and allows random access to the stored infomnation must take into 
account the variable (and unpredictable) data sizes involved, as well as be 
cognizant of the relationships between blocks of the stream. 

A second issue for a digital computer based implementation of such methods 
is that multiple streams of infomiation must be handled in parallel. For 
example, a broadcast stream is actually composed of at least two unique 
sequences of information, i.e. a stream of digital blocks representing the visual 
image and a stream of digital blocks representing the audible image. If the 
audio effect is instead stereo, then two audio streams are included, each 
unique. A broadcast signal may have additional data, such as the Secondary 
Audio Program (SAP), where the stream of information is a translation of the 
audio signal to a different language. Another stream which may be present is 
the Closed Caption (CC) stream, which provides a textual representation of 
spoken language in the audio stream(s). The simple broadcast stream 
described eariier may therefore have at least five different components, each 
one compressed using different techniques. When presenting this complex 
stream to a viewer; the blocks of each stream must be decoded at 
appropriate times for the compression methods involved and synchronized 
with the presentation of all other streams. 

Also of interest are digital broadcasting technologies, such as DVB. A DVB 
channel is fomied in an MPEG2 Transport Multiplex, which is an encoding 
scheme that provides for interieaving any number of discrete streams of 
digital information into a single stream of digital data, using techniques based 
on Time Division Multiplexing (TDM). The example television signal above 
can be encoded into a DVB channel using five discrete streams, leaving 
additional capacity for other streams. 

There is increasing interest in adding additional infonrcition streams to a 
standard broadcast signal. For instance, it may be desirable to transmit audio 
channels in several different languages in parallel with the video stream. Or, 
pertiaps infomiation that is interpreted as a Web page is broadcast oi such a 
way as to be synchronized with the video to provide a multimedia 
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presentation. The number of streams which must be synchronized may be 
arbitrary, and each stream may be represented using different and unique 
storage and compression techniques which have their own synchronization 
requirements and inter-frame relationships. 

Any methods which provide functionality similar to that described above 
using some form of digital computer must contain techniques which resolve 
these issues. 



The methods and apparatus described herein implement a novel and unique 
facility that provides for the manipulation of a continuous stream of digital 
information as if it supported random access and variable rate presentation on 
demand by the consumer of the stream. The possible operations on the 
data stream are limited only by the storage capacity and lO bandwidth of the 
system which implements the apparatus and methods, whether that storage 
be volatile, such as DRAM, or non-volatile, such as hard disk storage, or a 
combination of both. The apparatus and methods support operations, 
sometimes referred to as virtual VCR functions, such as pause, rewind, fast 
fonward, and play, as well as more sophisticated and unique operations, such 
as play faster, play slower, and play in reverse: Additionally, the apparatus 
and methods described herein provide a mechanism whereby such 
operations are instantaneous from the point of view of the consumer of the 
data stream, unlike in mechanical systems. The apparatus and methods also 
support the ability to capture portions of the data stream which are 
temporarily held by the apparatus, such that this captured data can be stored 
or sent over a networi< in formats suitable for presenting the data to other 
consumers, or for editing and manipulation by otiier tools or metiiods. 

The invention provides a method and apparatus for providing pass through 
or capture of continuous finear streams of digital information represented *n 
various formats while providing the appearance of a locally stored stream. 
The prefen'ed embodiment of the invention comprises at least one media 
cache for copying blocks of data from tiie information stream. Data in tiie 
media cache can be viewed as a snapshot of the continuous stream of digital 
infonnation. The invention also comprises a playback pointer. The playback 
pointer position selects a portion of the media cache that is to be accessed to 
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provide functions inducfing any of pause, rewind, fast fonward, play, play 
faster, play slower, and play in reverse. 



ng. 1 is a scfiematic representation of a continuous media buffer according to 
the invention; 

Fig. 2 is a block schematic diagram of a system incorporating a continuous 
media buffer according to the invention; 

Fig. 3 is a timing diagram showing the operation of a Bnear cache according to 
the invention; 

nETAILED DESCRIPTION CiF THE INVENTION 

The invention contemplates a new and unique apparatus for transfomiing and 
manipulating continuous streams of digital information using the methods and 
techniques described herein. 

Figure 1 provides a general oven/iew of how the prefen-ed embodiment of 
the invention operates on a continuous stream of data. In particular, one or 
more media caches 10 copy blocks of data from an infomiation stream 12. 
The position of a playback pointer 14 selects a portion of the media cache 
that is to be accessed, thus allowing such functions as pause, rewind, fast 
fonward, and play, as well as more sophisticated and unique operations, such 
as play faster, play slower, and play in reverse. 

The apparatus is independent of the format of the underlying digital data 
stream. To apply the invention, it is only necessary that the underlying data 
stream is composed of a number of discrete blocks of data, and that only a 
small set of attributes are associated with each block. The data blocks 
themselves may be of arbitrary size and format. It is assumed that processes 
specialized to a data source or to a decoding operation may proceed freely 
within the context of the invention, indirectly controlled by the methods 
described herein. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 2 illustrates the prefen-ed embodiment of this invention. While the 
example in Figure 2 is provided in connection with an MPEG stream of 
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information, it should be appreciat^ that the invention is readily applied to 
any digital stream of infomnation. 

Manipulation and transformation of a data stream is perfonmed in the invention 
by a combination of the following elements: 

1. The Buffer Controller (201) is the main management device. It 
accepts external requests (generated, for instance, from a remote control 
device) for operations on the digital stream and, in turn, generates appropriate 
control messages for the other devices of the invention; 

2. The Stream Clock (202) provides a general device for synchronizing 
operations on a set of linear caches, such thai multiple streams of data which 
must be presented in a synchronized fashion are correctiy positioned, and 
that they present tiieir data at consistent delivery rates; 

3. The Unear Cache (204) is a general device for buffering tiie 
information contained in a stream of digital infomiation, such that tiie data in the 
cache can be viewed as a snapshot of the continuous stream of digital data; 
and 

4. The Clip Capture (203) device is a general mechanism for capturing all 
or part of a cached set of infomnation streams and presenting the resulting data 
as an organized stream of data to a data sink such as, for example, a 
computer file system or another display device. 

The following sections describe each of the foregoing components, and how 
ttiey interact, in detail. 

BUFFER CONTROLLER 

The Buffer Controller (BC) module is responsible for Instantiating the 
methods that provide for operation of the invention for any particular stream of 
data. The preferred embodiment of the invention deludes one Stream Clock 
(SC) module, zero or more Clip Capture (CC) modules, and one or more 
Unear Cache (LC) stream modules. Operation of the invention preferably 
relies on designating one of tiie LC streams as the key stream. All 
positioning of playback from the LCs is based on first positioning the key 
stream, and then using the final position of tiiat stream to position ail otiier 
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Streams properly. For example. If the key stream is an MPEG video stream, 
then positioning is based on aligning the cunent blocl< pointer to the beginning 
of an MPEG Group Of Pictures (GOP). Once the key stream is positioned, 
the BC repositions all other streams to be synchronized with the key stream. 

The BC is also responsible for controlling the dock rale of the Stream Clock, 
which operates as described below. The rate at which the SC delivers dock 
events to the Linear Caches is based on the rate specified by the BC. Thus, 
the BC can speed up or slow down the playback of the entire collection of 
streams by requesting the SC to adjust its dock rate. 

Finally, the BC is responsible for initiating dip capture operations using the 
Clip Capture method. 

STREAM CLOCK 

The Stream Clock provides a central synchronization fadlity that distributes 
time-based events to a number of LCs. Each stream of information in a 
broadcast program is encoded using different techniques, each technique 
having unique notions of. for example, block size, data format, and 
presentation time stamps for decoding. For instance, in the time it takes for a 
single MPEG video frame to be decoded and presented, there may be 
several MPEG audio frames to be decoded and presented. The Stream 
Clock distributes events to each LC at the proper rate for that LC. 

The Stream Clock uses a single queue for managing events to be distributed 
using standard, prior art techniques that are femiiliar to those skilled in the art of 
building time-based software systems. 

Operation of the Stream Clock is detailed below. 

Events are not queued with an absolute time, but with the relative offset from 
the previously queued event. The first event in the queue is marked with a 
time which is the offset from the current time. The Stream Clock then requests 
that the underlying operating system invoke a call out when that amount of 
time has passed, e.g. the SC queues a timer event. When the caD out is 
invoked, the SC dispatches the first dock event and then each succeeding 
event whose offset is zero, i.e. each event that is to occur at the same time as 
the previous event. 
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Each event in the queue is also maiked with a time interval and the absolute 
time of the data stream that the next event should occur. Atter a queued 
event is dispatched, the absolute time of the next event is composed by 
adding the time interval to the previous absolute time. The difference 
between this time and the current absolute time detemiines the delay before 
this event should be repeated. The event is then added to the queue at the 
appropriate posrtion for the event's next occurence. This strategy insures that 
even if some delay occurs in dispatdiing the event, following events occur at 
the proper time. After afl events with a offset of zero have been processed, 
the SC queues a new timer event at the offset to the first waiting event, after 
which tiie above process repeats. 

The decoupling of tiie absolute clock value and the actual dispatching of dock 
events Is critical in implementing some of ttie unique and novel aspects of ttie 
invention, e.g. the ability to cortrol easily ttie rate at which playback of tiie 
stream occurs. When Initially created, ttie SC records a value referred to as 
ttie rate multiplier, which is initially set to one. Whenever tine SC requests ttie 
underlying operating system to queue a timer event, ttie actual time delay 
requested is multiplied by tiie rate multiplier. A rate multiplier greater ttian one 
results m faster playback of the stream, while a multiplier of less ttian one 
results in a slower playback of tiie stream. 

For the LCs to position themselves properly (see below), the same rale 
multiplier must be used to modify the Presentation Time Stamp (PTS) stored 
in each block when calculating positioning. Thus, ttie SC makes a small set of 
functions available to the LC whldi perfonm comparisons and operations on 
PTS values, after properly synchronizing those values witfi tfie rate multiplier 
stored in the SC. 

The combination of the Stream Clock, wrth Independent event generation for 
each LC and tiie ability to modify logically ttie rate at which aD events are 
dispatched using a single value provides a novel and unique application of 
common time-based software design techniques. 
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THE LINEAR CACHE 

A key component of the invention is the Unear Cache (LC). The LC is used 
to store a single stream of digital infomiation h proper sequence. The LC 
maintains this stream as a list of the digital blocks that make up the stream, and 
assumes that each block has a unique size and attributes. Each block within 
the stream is mari<ed with a Presentation Time Stamp (PTS) which indicates 
when that block should be presented to the decoding process, be it 
hardware or software. The PTS is a monotonically increasing value initialized 
to zero when the LC first begins operation on a stream, and is not related to 
any time stamps defined by the underlying stream encoding technique 
except that once the LC begins operation, there is a constant offset between 
the PTS stored in the LC and any time stamps stored within the encoded 
digital blocks. It is assumed that the decoding process properly handles 
embedded time stamps. Figure 3 gives a pictorial overview of the preferred 
embodiment of the Linear Cache. 

The capture mechanism for a particular stream type gives each encoded 
digital block to the LC as it anives (301). The LC marks that block with the 
current PTS for the stream. The LC maintains a window (302), which is 
defined as the maximum time, based on PTS values, between when the 
newest block in the LC arrived and the oldest block which the LC may hold. 
Thus, the window represents a time span into the past history of the stream. 
The LC discards blocks which fall outside the window (303). thus the window 
allows one to look a fixed distance into the past history of a stream. This 
mechanism allows choices and tradeoffs between available storage space 
and the availability of past information for viewing. 

The capture mechanism is responsible for providing certain attributes along 
with each new block (304). The first of tiiese is the key frame attribute, which 
indicates thai this block begins a sequence of intenelated blocks. When 
performing random access operations on a stream, the LC only allows 
positioning of the sti-eam to a block merited as a key frame. The second is the 
End Of Segment (EOS) attribute, which indicates that tiie stream has ended, 
and no more data are to be presented. For example, the l-ft^me of an 
MPEG GOP is mari<ed as a key fi^me by the capture mechanism, but aH 
ottier blocks are not so merited. 
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The LC maintains an indication of the next block to be presented to the 
decoding process, which is referred to as the current block indicator (305). For 
. nonnal viewing, this block is the block last added to the LC by the capture 
mechanism. Thus, the stream is presented live, with at most a one-frame time 
delay between capture and presentation. 

Random access to the infomiation stream is achieved by moving the current 
block indicator to some other block in the LC. If the LC is requested to move 
the indicator, and the cunent block indicated Is not a key frame, the LC instructs 
the decoding process to reset its decoding state, thus purging any partially 
constmcted presentation data. For example, if the current frame is an MPEG 
predictive frame (P-frame), then the decoding process has state information, 
including the cunBnt picture to be modified by the P-frame. This data must be 
purged so that the decoder begins in the proper state. 

Referring again to Figure 2. the BC requests repositioning of an LC by 
specifying a PTS value. The LC finds the block containing a PTS which is 
closest to that requested by the BC. There are two unique cases for 
repositioning: the firet. if the LC is marked as the key stream, and the second 
when it is not. 

If the LC is marked as a key stream, the LC scans the blocks in the cache to 
find the key frame wNch is nearest to the requested PTS value, searching 
both before and after the desired value. Once properly positioned, the LC 
returns to the BC the PTS of the key frame block which was identified. 

Following positioning of the key stream, the BC instaicts each remaining 
stream to position itself to the PTS retumed by the key-stream. Key frames h 
other LCs may not align with those in the key stream. Each LC handles the 
problem by requesting the decoding process to purge any decoding state, 
and then stores an indication that suppresses the LC from actually presenting 
blocks to the decoding process. 

Following this, the LC behaves normally, accepting dock events and 
advancing the current block indicator to match, except friat the blocks are not 
actually presented to the decoding process. When the LC encounters a key 
frame, it deletes the indication suppressing presentation of blocks, and 
presents the current block to the decoding process. This brings the stream 
into full synchronization with the key stream. 
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The following subsections describe in detail how positioning operations are 
handled by the LC. 



Forward 

The forward function is implemented by moving the current block indicator 
fonward through the cache by one block for each event generated by the 
Stream Clock. Each successive block in the cache is given to the decoding 
process for presentation as dock events are dispatched by the SC. A rate 
multiplier of one causes the current block indicator to move forward one block 
in the LC at the same rate as new blocks are added to the LC by the capture 
process. Conceptually, the curent block indicator points to the same position 
in the cache, while the window of cached blocks moves beneath iL From the 
perspective of the consumer of a stream, this results in play out at a fixed 
delay from real time. 

There exists a rate at which the decoding process may not be able to 
present a block to the consumer before decoding of the next block must 
begin, i.e. the decoder can not keep up with the playback rate requested. 
The LC recognizes this case, and reverts to a key frame based method, 
where intermediate blocks are skipped and only key frames are presented to 
the decoder. 

The current block indicator can not be positioned to an uncached block. If the 
rate multiplier indicates a rate greater than one, then the current block indicator 
eventually moves to the frunt of the cache. In this case, the key frame LC 
indicates to the BC that no more blocks are present in the cache. The BC 
resets the SC rate multiplier to one, and each LC positions the current block 
indicator to the latest cached block. The effect of this operation is to revert to 
undelayed play out of the actual infomnation stream. 



Reverse 



The reverse function is implemented by moving the current block indicator 
backwards through the cache by one block for each clock event generated b y 
the SC. Again, the key stream LC is used to control positioning. The BC 
instructs each LC to move to reverse mode, in which the current block 
indicator is moved backwards one block on each dock event. In addition, this 
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mode causes the LC to suppress presentation of all but key frames to the 
decoding process. When the cunent blocl< indicator is moved over a key 
frame, the block is presented to the decoding process. 

The rate at which blocks are presented to the decoding process is controlled 
by the rate multiplier h the Stream Clock, allowing for arbitrary speed of 
reveree operation. 

The current block indicator can not be moved past the oldest block in the 
cache. If a dock event would result in moving the current block indicator past 
the earliest block, then the indicator is set to that block, and play out continues 
as described above. The key frame LC indicates to the BC that the oldest 
cached block was reached. The BC resets the SC rate multiplier to one, and 
each LC positions the current block indicator to the oldest cached block. 
Externally, it appears as if the stream began playing in a forward direction 
again, time delayed by the window size. 



The BC implements the pause function by locking the cunent block indicator h 
the key stream LC to that block. The LC can only tock the indicator to a key 
frame block, thus it searches forward for such a block h the cache. If no key 
frame is present, the LC stores an indication that a lock has been requested. 
When the capture process presents a key frame, the LC thus recognizes that 
the lock was requested, and locks the cunent block indicator to that key frame. 
The LC also presents this key frame to the decoding process, such that 
proper positioning, from the decoding process point of view, is maintained. 
Following this, as new blocks are added to the front of the cadie, the block 
indicated moves backwards in the cache. Additionally, the LC sets the 
indicator suppressing further presentation of data to the decoding process. 

For each non-key-stream LC, the BC simply suppresses further presentation 
of data blocks to the decoding process. The BC may be requested to 
release the pause {i.e. to play the stream). In this case, the BC obtains the 
PTS of the current block (as modified by the SC rate multiplier) from the key 
stream LC, and directs eadi remaining LC to position appropriately as 
described above. It then releases the lock on each LC, resulting n 
presentation of blocks to the decoding processes. 



Pause 
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tt Is possible that the window moves past the block to wKch the Indicator is 
locked because capture of the incoming stream e continuous. In this case, the 
indicator is unlocked and the lett pointing at the earliest block n the cache. This 
gives the external effect of automatic release of the pause, similar to the 
behavior of a VCR, which releases the pause to avoid excessive wear on 
the tape and playback heads. 

Get Clip 

This function is used to extract a range of blocks from the cache. There are two 
modes in which it operates: relative mode and absolute mode. 

As discussed above, the cached blocks appear as a linear list of blocks n 
presentation order. The gtetcllpO function returns a pointer to a cached blpck, 
plus a count of the number of blocks that make up the range. This Information 
is referred to as a dip. For efficiency sake, the pointer returned directly 
indicates a cached block, and the following blocks are retrieved by following 
the next block Onk in each -cached block. This list is only valid If the first block 
pointed to and afl succeeding blocks remain cached until the caller has 
completed its operation on the clip. 

The LC has the following- behavior to avoid invalidating blocks which ate part 
of a clip: the getclipQ function sets an indicator in the LC that suppresses 
discarding blocks which fall outside the window. This indicator is cleared when 
the releaseclipO funclron- is called. The releaseclipO function also trims the 
cache to the window size.- 

The result of this sequence is that the number of blocks in the LC may 
temporarily increase. It is the responsibility of the caller of the getclipO function 
to operate on the clip as rapidly as possible because storage space for 
cached blocks is a limited resource. However, this strategy allows a clip to be 
captured without interfering with the ongoing playback of blocks from the LC, 
which avoids presenting tiie consumer of the blocks with any artifacts of the 
capture operation. 

In relative mode, the clip to be captured is specified as two numbers, the first 
representing an offset from the current block Indicator into previous blocks h 
the cadie, and the second representing an offset from the cunnent block 
indicator Into subsequent blocks in the cache. The offset in both cases is 




W099/52279 ^ ^ PCr/US99/04921 

expressed as a time differential from the PTS stored h the cunenl block. For 
instance, a capture operation might occur from five seconds before the cunent 
block to ten seconds after the cuitent block. No uncached blocks may be 
indicated by the getcIipQ operation. Thus, the LC trims the actual clip returned 
to match the blocks available in the cache. 

In absolute mode, the dip to be captured is specified as two numbers, the 
firet representing an absolute time stamp based on when the LC began 
operation, and the second representing a time offset from that value, which 
may be a negative offset (indicating a distance backwards in the cache) or a 
positive value (indicating distance fonwards in the cache). The clip is returned 
as described above, and trimmed as necessary. This also implies that 
getclipO can return an empty list of blocks if no block in the cache is within the 
time window described. 



CLIP CAPTURE 

The Clip Capture module is responsible for selecting ranges of blocks from a 
set of LCs. bringing them together into an appropriate storage or 
transmission fomiat, and saving or sending them as needed. This collection of 
blocks from multiple LCs is also referred to as a clip. The Clip Capture 
operation is invoked by the BC on external request. 

There are two ways in which this request may be made: 

Rrst. the BC may indicate that capture should occur based on the current 
block indicator in the key stream LC, whidi is referred to as a relative capture, 
in which case a relative range of capture Is specified. 

Second, the BC may indicate an absolute range, in which case only blocks 
marked with a PTS inclusive in that range are captured. This is refen-ed to as 
an absolute capture. 

The operation of the Clip Capture module is straightfonward. For each LC, the 
CC module calls the getclipO function of that LC with parameters as directed 
by the BC. The Clip Capture module then linearizes the blocks into a 
multiplexed stream appropriate for the final clip format desired. For example, 
it might generate an MPEG System Stream, interieaving blocks from the 
various LCs as needed. 
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"TTiis interieaving Is conceptually simple. The CC loops through each dip 
obtained from an LC. choosing the block with the earliest PTS from among aS 
clips, outputting the block as appropriate, and advancing the block pointer for 
that clip to the next block. When all the blocks from all dips are exhausted, the 
capture operation is complete. The CC then loops through all LCs calling the 
releasedipO function. 

Although the invention is described herein with reference to the preferred 
embodiment, one skilled in the art will readily appredate that other 
applications may be substituted for those set forth herein without departing 
from the spirit and scope of the present invention. Accordingly, the invention 
should only be limited by the Claims included below. 
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CLAIMS 



1. A method for providing pass through or capture of continuous Onear 
streams of digital information represented in various formats vi/hile providing 
the appearance of a locally stored stream, comprising the steps of: 

providing at least one media cache for copying blocks of data from 
said information stream, wherein data in said media cache can be viewed as a 
snapshot of said continuous stream of digital information; 

providing a playback pointer, wherein a playback pointer position 
selects a portion of said media cache that is to be accessed to provide 
functfons including any of pause, rewind, fast forward, play, play faster, play 
slower, and play in reverse; and 

wherein said underiying infomiation stream can be composed of a 
plurality of discrete blocks of data. 

2. The method of Claim 1 . further comprising the step of: 

providing a buffer controller that accepts external requests for 
operations on said information stream and that, in turn, generates appropriate 
control messages; and 

wherein said buffer controller instantiates methods thai provide for 
system operation in connection with any particular stream of infomiation. 

3. The method of Claim 1 , further comprising the step of: 

providing a stream dock for synchronizing operations on said more 
than one media cache, wherein multiple streams of data which must be 
presented in a synchronized fashion are correctly positioned, and wherein 
said multiple streams of data present their data at consistent delivery rates; 



wherein said stream clock provides a cerrtral synchronization facility that 
distributes time based events to a number of media caches. 

4. The method of Claim 3, further comprising the step of: 
designating one of said multiple streams as a key stream; 
wherein all positioning of playback from each media cache is based on 

first positioning said key stream, and then using a final position of that stream 

to position all other streams property; 

wherein positioning is based on aligning a current block pointer to the 

beginning of an MPEG Group Of Pictures when said key stream is an MPEG 

video stream; and 



and 
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Wherein a buffer controller repositions ail other streams to be 
synchronized with said key stream once said key stream is positioned. 

5. The method of Claim 3, further comprising the step of: 

providing a buffer controller that is responsible for controlling a dock 

rate of said stream clock; 

wherein the rate at which said stream clock delivers dock events to said 
media cache is based on a rate spedfied by said buffer controllen and 

wherein said buffer controller can speed up or slow down playback of 
an entire collection of streams by requesting said stream dock to adjust its 
dock rate. 

6. The method of Claim 3, wherein events are queued with a relative 
offset from a previously queued event; 

wherein a first event in said queue is marked with a time which is an 
offset from a current time; 

wherein said stream dock requests that an undertying operating 
system invoke a call out when that amount of time has passed; and 

wherein said stream dock dispatches a first dock event and then each 
succeeding event whose offset is zero when said call out is invoked. 

7. The method of Claim 1, wherein said media cache maintains said 
stream as a list of the digital blocks that make up said stream, and assumes 
that each block in said stream has a unique size and attributes; 

wherein said attributes indude any of a key frame attribute which 
indicates that said block begins a sequence of inter-related blocks such that 
said media cache only allows positioning of said stream to a block mari<ed as 
a key frame when performing random access operations on said stream; and 
an End Of Segment (EOS) attribute which indicates that said stream has 
ended and that no more data are to be presented; 

wherein each block within said stream is mari<ed with a Presentation 
Time Stamp (PTS) which indicates when that block should be presented to a 
decoding process; 

wherein said PTS is a monotonically increasing value initialized to zero 
when said media cache first begins operation on a stream and is not related to 
any time stamps defined by an undertying stream encoding technique, 
except that there is a constant offset between said PTS stored in said media 
cache and any time stamps stored within said encoded digital blocks once 
said media cache begins operation; 
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wherein a capture mechanism for a particular stream type gives each 
encoded digital block to said media cache as it anives and provides certain 
attributes along with each new blodq 

wherein said media cache marks said encoded digital block with a 

cument PTS for said stream; 

wherein said media cache maintains a window which is the maximum 
time, based on PTS values, between the time when a newest block in said 
media cache has anived and an oldest block which said media cache may 
hold, such that said window represents a time-span into a past history of said 
stream; 

wherein said media cache maintains an indication of a next block to be 
presented to a decoding process; and 

wherein said next block is a block last added to said media cache by 
said capture mechanism, such that said stream is presented live with at most a 
one-frame time delay between capture and presentation. 

8. The method of Claim 7, wherein random access to said stream is 
achieved by moving a current block indicator to another block in said media 
cache; and 

wherein said media cache instmcls a decoding process to reset its 
decoding state, thus purging any partially constmcted presentation data, if said 
media cache is requested to move said block indicator, and a cunent block 
indicated is not a key frame. 

9. The method of Claim 7. further comprising the step of: 

providing a buffer controller that requests repositioning of a media 
cache by specifying a PTS value; 

wherein said media cache scans blocks of data in said media cadie to 
find a key frame which is nearest to a requested PTS value, searching both 
before and after a desired value, if said media cache is marked as a key 
stream, wherein said media cache retums PTS of said key frame block which 
was identified to said buffer controller, once said media cadie is properly 
positioned; 

instructing each remaining stream to position itself to said PTS returned 
by said key stream, wherein key frames in other media caches may not align 
with those in said key stream, in which event each media cache requests a 
decoding process to purge any decoding state, and then stores an indication 
that suppresses each other media cache from actually presenting blocks to 
said decoding process; 
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wherein said media cache accepts dock events and advances a current 
block indicator to ntatch. except that said blocks are not actually presented to 
said decoding process; 

deleting an indication suppressing presentation of blocks; and 
presenting a current block to said decoding process when said media 
cache encounters a key frame. 

10. The method of Claim 1. wherein a fonward/reverse function is 
implemented by moving a cun^nt block indicator foPA^ard/reverse through said 
media cache by one block for each event generated by a stream clock; 

wherein each successive block in said media cadie is given to an 
optional decoding process for presentation as dock events are dispatched 
by said stream clock; 

wherein a rate multiplier causes a current block indicator to move 
fonward/reverse within said media cache at a selected rate as new blocks are 
added to said media cache by a capture process, resulting in play out at a 
fixed delay from real time; 

wherein said media cache reverts to a key frame based method ii 
which intermediate blocks are skipped and only key frames are presented to 
said decoder if said decoder can not keep up with a playback rate requested; 

wherein a buffer controller resets said stream dock rate multiplier to 
one, and each media cache positions a current block indicator to a latest 
cached block if said current block indicator moves to a front of said media 
cache where no more blocks are present in said media cache; 

wherein said current block indicator is set to an earliest block in said 
media cache if a dock event would result in moving a current block indicator 
past said earliest block; and 

wherein play out ttiereafter continues and that it appears as if said 
stream is playing in a fonward direction again. 

1 1 . The method of Claim 1 , further comprising the step of: 

providing a buffer controller for implementing a pause function by 

locking a oirrent block indicator in a key stream media cache to a key frame 

block nearest a currently indicated block; 

searching fonft^ard for said key frame block in said media cache if said 

current block is not a key frame block; 

storing an indication that a lock has been requested if no key frame is 

present; 
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locking said cunenl block Indicator to a key frame when a capture 
process presents a key frame; and 

wherein said indicator is unlocked and left pointing at an eariiest block h 
said media cache if a stream window moves past a block to which said 
indicator is locked, thereby providing automatic release of said pause. 

1 2. The method of Claim 1 , further comprising the steps of; 

extracting a range of blocks from said media cache in either of a relative 
mode and an absolute mode by returning a pointer to a cached block plus a 
count of the number of blocks that make up an extract range; and 
providing an indicator in said media cache that suppresses discarding blocks 
within said extraction which fall outside a window. 

13. The method of Claim 1 . further comprising the step of: 

providing a clip capture module for selecting ranges of blocks from a 
set of media caches, bringing said blocks together into an appropriate storage 
or transmission format, and saving or sending said blocks as needed; and 

wherein a buffer controller indicates where a capture should occur 
based on a current block indicator in a key stream media cache. 

1 4. An apparatus for providing pass through or capture of continuous linear 
streams of digital infomiation represented in various formats while providing 
the appearance of a locally stored stream, comprising: 

at least one media cache for copying blocks of data from said 
infomiation stream, wherein data in said media cache can be viewed as a 
snapshot of said continuous stream of digital information; 

a playback pointer, wherein a playback pointer position selects a 
portion of said media cache that is to be accessed to provide functions 
including any of pause, rewind, fast fonward, play, play faster, play slower, 
and play in reverse; and 

wherein said underlying information stream can be composed of a 
plurality of discrete blocks of data. 

1 5. The apparatus of Claim 14, further comprising: 

a buffer controller that accepts extemal requests for operations on said 
information stream and that, in turn, generates appropriate control messages; 
and 

wherein said buffer controller instantiates methods ttiat provide for 
system operation in connection with any particular stream of information. 
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1 6. The apparatus of Claim 1 4, further comprising: 

a stream dock for synchronizing operations on said more than one 
media cache, wherein multiple streams of data which must be presented in a 
synchronized fashion are conrectty positioned, and wherein said multiple 
streams of data present their data at consistent delivery rates; and 

wherein said stream clock provides a central synchronization facility that 
distributes time based events to a number of media caches. 

1 7. The apparatus of Claim 1 6, further comprising: 

a module for designating one of said multiple streams as a key stream; 

wherein all positioning of playback from each media cache is based on 
first positioning said key stream, and then using a final position of that stream 
to position all other streams property; 

wherein positioning is based on aligning a current block pointer to the 
beginning of an MPEG Group Of Pictures when said key stream is an MPEG 

video stream; and 

wherein a buffer controller repositions aB other streams to be 
synchronized with said key stream once said key stream is positioned. 

1 8. The apparatus of Claim 1 6, further comprising: 

a buffer controller that is responsible for controlling a dock rate of said 
stream dock; 

wherein the rate at which said stream dock delivers clock events to said 
media cache is based on a rate specified by said buffer controller, and 

wherein said buffer controller can speed up or slow down playback of 
an entire collection of streams by requesting said stream dock to adjust its 
dock rate. 

19. The apparatus of Claim 16, wherein events are queued with a relative 
offset from a previously queued event; 

wherein a first event in said queue is mari<ed with a time whidi is an 
offset from a current time; 

wherein said stream dock requests that an undertying operating 
system invoke a call out when that amount of time has passed; and 

wherein said stream dock dispatches a first dock event and then each 
succeeding event whose offset is zero when said call out is invoked. 
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20. The apparatus of Claim 14, wherein said media cache maintains said 
stream as a list of the digital blocks that make up said stream, and assumes 
that each block in said stream has a unique size and attributes; 

wherein said attributes indude any of a key frame attribute which 
indicates that said bloc* begins a sequence of inter-related blocks such that 
said media cache only allows positioning of said stream to a block mari<ed as 
a key frame when performing random access operations on said stream; and 
an End Of Segment (EOS) attribute which indicates that said stream has 
ended and that no more data are to be presented; 

wherein each block within said stream is marked with a Presentation 
Time Stamp (PTS) which indicates when that block should be presented to a 
decoding process; 

wherein said PTS is a monotonically increasing value initialized to zero 
when said media cache first begins operation on a stream and is not related to 
any time stamps defined by an undertying stream encoding technique, 
except that there is a constant offset between said PTS stored in said media 
cache and any time stamps stored within said encoded digital blocks once 
said media cache begins operation; 

wherein a capture medianism for a particular stream type gives eadi 
encoded digital block to said media cache as ft amves and provides certain 
attributes along with each new block; 

wherein said media cache marks said encoded digital block with a 
current PTS for said stream; 

\Mierein said media cadie maintains a window wh'ch is the maximum 
time, based on PTS values, between the time when a newest block in said 
media cache has arrived and an oldest block which said media cache may 
hold, such that said window represents a time-span into a past history of said 
stream; 

wherein said media cache maintains an indication of a next block to be 
presented to a decoding process; and 

wherein said next block is a block last added to said media cache by 
said capture mechanism, such that said stream is presented live with at most a 
one-frame time delay between capture and presentation. 

21 . The apparatus of Claim 20, wherein random access to said stream is 
achieved by moving a current block indicator to another block in said media 
cache; and 

wherein said media cache instructs a decoding process to reset its 
decoding state, thus purging any partially constructed presentation data, if said 



PCTAJS99/04921 



WO 99/52279 



24 



media cache is requested to move said block indicator, and a cunent block 
indicated is not a key frame. 

22. The apparatus of Claim 20, further comprising: 

a buffer controUer that requests repositioning of a media cache by 

specifying a PTS value; 

wherein said media cache scans blocks of data in said media cache to 
find a key frame which is nearest to a requested PTS value, searching both 
before and after a desired value. If said media cache is marked as a key 
stream, wherein said media cache returns PTS of said key frame block which 
was identified to said buffer controller, once said media cache Is properly 
positioned; 

a module for instructing each remaining stream to position itself to said 
PTS retumed by said key stream, wherein key frames in other media caches 
may not align with those in said key stream, in which event each media cache 
requests a decoding process to purge any decoding state, and then stores 
an indication that suppresses each other media cache from actually presenting 
blocks to said decoding process; 

wherein said media cache accepts clock events and advances a cunent 
block indicator to match, except that said blocks are not actually presented to 
said decoding process; 

a module for deleting an indication suppressing presentation of blocks; 

and 

a module for presenting a cunBnt block to said decoding process when 
said media cache encounters a key frame. 

23. The apparatus of Claim 14, wherein a forward/reverse function is 
implemented by moving a cunent block indicator fonward/reverse through said 
media cache by one block for each event generated by a stream clock; 

wherein eadi successive block in said media cache is given to an 
optional decoding process for presentation as dock events are dispatched 
by said stream clock; 

wherein a rale multiplier causes a current block indicator to move 
fonward/reveree within said media cache at a selected rate as new blocks are 
added to said media cache by a capture process, resulting in play out at a 
fixed delay from real time; 

wherein said media cache reverts to a key frame based method r 
which intermediate blocks are skipped and only key frames are presented to 
said decoder if said decoder can not keep up with a playback rate requested; 
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Wherein a buffer controller resets said stream dock rate multiplier to 
one. and each media cache positions a current block incficator to a latest 
cached block if said current block indicator moves to a front of said media 
cache where no more blocks are present in said media cache; 

wherein said am&nt block indicator is set to an earliest block In said 
media cache if a dbdk event would result in moving a current block indicator 
past said earliest block; and 

wherein play out thereafter continues and that it appears as if said 
stream is playing in a fonvard direction again. 

24. The apparatus of Claim 14, further comprising: 

a buffer controller for implementing a pause function by locking a 
current block indicator in a key stream media cache to a key frame block 
nearest a currently indicated block; 

a module for searching fonvard for said key frame block in said media 
cache if said cument block is not a key frame block; 

a module for storing an indication that a lock has been requested if no 

key frame is present; 

a module for locking said current block indicator to a key frame when a 
capture process presents a key frame; and 

wherein said indicator is unlocked and left pointing at an earliest block h 
said media cache if a stream window moves past a block to which said 
indicator is locked, thereby providing automatic release of said pause. 

25. The apparatus of Claim 14, further comprising: 

a module for extracting a range of blocks from said media cache h 
either of a relative mode and an absolute mode by retuming a pointer to a 
cached block plus a count of the number of blocks that make up an extract 
range; and 

an indicator in said media cache that suppresses discarding blocks 
within said extraction which fall outside a window. 

26. The apparatus of Claim 14, further comprising: 

a clip capture module for selecting ranges of blocks from a set of 
media caches, bringing said blocks together into an appropriate storage or 
transmission format, and saving or sending said blocks as needed; and 

wherein a buffer controller indicates where a capture should occur 
based on a current block indicator in a key stream media cache. 
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Where required, 
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